Dataset statistics
| Number of variables | 21 |
|---|---|
| Number of observations | 179078 |
| Missing cells | 513118 |
| Missing cells (%) | 13.6% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 28.7 MiB |
| Average record size in memory | 168.0 B |
Variable types
| Numeric | 8 |
|---|---|
| Categorical | 13 |
batsman has a high cardinality: 516 distinct values | High cardinality |
non_striker has a high cardinality: 511 distinct values | High cardinality |
bowler has a high cardinality: 405 distinct values | High cardinality |
player_dismissed has a high cardinality: 487 distinct values | High cardinality |
fielder has a high cardinality: 499 distinct values | High cardinality |
wide_runs is highly correlated with extra_runs | High correlation |
legbye_runs is highly correlated with extra_runs | High correlation |
batsman_runs is highly correlated with total_runs | High correlation |
extra_runs is highly correlated with wide_runs and 1 other fields | High correlation |
total_runs is highly correlated with batsman_runs | High correlation |
wide_runs is highly correlated with extra_runs | High correlation |
legbye_runs is highly correlated with extra_runs | High correlation |
batsman_runs is highly correlated with total_runs | High correlation |
extra_runs is highly correlated with wide_runs and 1 other fields | High correlation |
total_runs is highly correlated with batsman_runs | High correlation |
wide_runs is highly correlated with extra_runs | High correlation |
legbye_runs is highly correlated with extra_runs | High correlation |
batsman_runs is highly correlated with total_runs | High correlation |
extra_runs is highly correlated with wide_runs and 1 other fields | High correlation |
total_runs is highly correlated with batsman_runs | High correlation |
is_super_over is highly correlated with inning | High correlation |
bye_runs is highly correlated with dismissal_kind | High correlation |
penalty_runs is highly correlated with dismissal_kind | High correlation |
inning is highly correlated with is_super_over | High correlation |
dismissal_kind is highly correlated with bye_runs and 1 other fields | High correlation |
inning is highly correlated with is_super_over | High correlation |
is_super_over is highly correlated with inning | High correlation |
wide_runs is highly correlated with extra_runs and 1 other fields | High correlation |
bye_runs is highly correlated with extra_runs | High correlation |
legbye_runs is highly correlated with extra_runs | High correlation |
penalty_runs is highly correlated with extra_runs | High correlation |
batsman_runs is highly correlated with total_runs and 1 other fields | High correlation |
extra_runs is highly correlated with wide_runs and 4 other fields | High correlation |
total_runs is highly correlated with wide_runs and 3 other fields | High correlation |
dismissal_kind is highly correlated with batsman_runs and 1 other fields | High correlation |
player_dismissed has 170244 (95.1%) missing values | Missing |
dismissal_kind has 170244 (95.1%) missing values | Missing |
fielder has 172630 (96.4%) missing values | Missing |
wide_runs has 173673 (97.0%) zeros | Zeros |
legbye_runs has 176141 (98.4%) zeros | Zeros |
batsman_runs has 70845 (39.6%) zeros | Zeros |
extra_runs has 169541 (94.7%) zeros | Zeros |
total_runs has 63002 (35.2%) zeros | Zeros |
Reproduction
| Analysis started | 2021-11-02 07:41:54.568008 |
|---|---|
| Analysis finished | 2021-11-02 07:42:39.701862 |
| Duration | 45.13 seconds |
| Software version | pandas-profiling v3.1.0 |
| Download configuration | config.json |
match_id
Real number (ℝ≥0)
| Distinct | 756 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1802.252957 |
| Minimum | 1 |
|---|---|
| Maximum | 11415 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 38 |
| Q1 | 190 |
| median | 379 |
| Q3 | 567 |
| 95-th percentile | 11314 |
| Maximum | 11415 |
| Range | 11414 |
| Interquartile range (IQR) | 377 |
Descriptive statistics
| Standard deviation | 3472.322805 |
|---|---|
| Coefficient of variation (CV) | 1.926656739 |
| Kurtosis | 2.245787145 |
| Mean | 1802.252957 |
| Median Absolute Deviation (MAD) | 188 |
| Skewness | 1.996380528 |
| Sum | 322743855 |
| Variance | 12057025.66 |
| Monotonicity | Increasing |
| Value | Count | Frequency (%) |
| 126 | 267 | 0.1% |
| 34 | 263 | 0.1% |
| 534 | 262 | 0.1% |
| 476 | 262 | 0.1% |
| 388 | 261 | 0.1% |
| 190 | 259 | 0.1% |
| 570 | 259 | 0.1% |
| 536 | 258 | 0.1% |
| 401 | 258 | 0.1% |
| 257 | 257 | 0.1% |
| Other values (746) | 176472 |
| Value | Count | Frequency (%) |
| 1 | 248 | |
| 2 | 247 | |
| 3 | 218 | |
| 4 | 247 | |
| 5 | 248 | |
| 6 | 216 | |
| 7 | 254 | |
| 8 | 212 | |
| 9 | 226 | |
| 10 | 239 |
| Value | Count | Frequency (%) |
| 11415 | 248 | |
| 11414 | 239 | |
| 11413 | 252 | |
| 11412 | 237 | |
| 11347 | 228 | |
| 11346 | 235 | |
| 11345 | 246 | |
| 11344 | 224 | |
| 11343 | 234 | |
| 11342 | 249 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 1 | |
|---|---|
| 2 | |
| 3 | 50 |
| 4 | 38 |
| 5 | 8 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 1 |
| 3rd row | 1 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 1 | 92742 | |
| 2 | 86240 | |
| 3 | 50 | < 0.1% |
| 4 | 38 | < 0.1% |
| 5 | 8 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 1 | 92742 | |
| 2 | 86240 | |
| 3 | 50 | < 0.1% |
| 4 | 38 | < 0.1% |
| 5 | 8 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
batting_team
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| Mumbai Indians | |
|---|---|
| Kings XI Punjab | |
| Royal Challengers Bangalore | |
| Kolkata Knight Riders | |
| Chennai Super Kings | |
| Other values (9) |
Length
| Max length | 27 |
|---|---|
| Median length | 16 |
| Mean length | 17.99314265 |
| Min length | 13 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Sunrisers Hyderabad |
|---|---|
| 2nd row | Sunrisers Hyderabad |
| 3rd row | Sunrisers Hyderabad |
| 4th row | Sunrisers Hyderabad |
| 5th row | Sunrisers Hyderabad |
Common Values
| Value | Count | Frequency (%) |
| Mumbai Indians | 22619 | |
| Kings XI Punjab | 20931 | |
| Royal Challengers Bangalore | 20908 | |
| Kolkata Knight Riders | 20858 | |
| Chennai Super Kings | 19762 | |
| Delhi Daredevils | 18786 | |
| Rajasthan Royals | 17292 | |
| Sunrisers Hyderabad | 12908 | |
| Deccan Chargers | 9034 | 5.0% |
| Pune Warriors | 5443 | 3.0% |
| Other values (4) | 10537 |
Length
| Value | Count | Frequency (%) |
| kings | 40693 | 9.1% |
| mumbai | 22619 | 5.1% |
| indians | 22619 | 5.1% |
| xi | 20931 | 4.7% |
| punjab | 20931 | 4.7% |
| royal | 20908 | 4.7% |
| challengers | 20908 | 4.7% |
| bangalore | 20908 | 4.7% |
| kolkata | 20858 | 4.7% |
| knight | 20858 | 4.7% |
| Other values (21) | 213444 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
bowling_team
Categorical
| Distinct | 14 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| Mumbai Indians | |
|---|---|
| Royal Challengers Bangalore | |
| Kolkata Knight Riders | |
| Kings XI Punjab | |
| Chennai Super Kings | |
| Other values (9) |
Length
| Max length | 27 |
|---|---|
| Median length | 16 |
| Mean length | 18.01460258 |
| Min length | 13 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Royal Challengers Bangalore |
|---|---|
| 2nd row | Royal Challengers Bangalore |
| 3rd row | Royal Challengers Bangalore |
| 4th row | Royal Challengers Bangalore |
| 5th row | Royal Challengers Bangalore |
Common Values
| Value | Count | Frequency (%) |
| Mumbai Indians | 22517 | |
| Royal Challengers Bangalore | 21236 | |
| Kolkata Knight Riders | 20940 | |
| Kings XI Punjab | 20782 | |
| Chennai Super Kings | 19556 | |
| Delhi Daredevils | 18725 | |
| Rajasthan Royals | 17382 | |
| Sunrisers Hyderabad | 12779 | |
| Deccan Chargers | 9039 | |
| Pune Warriors | 5457 | 3.0% |
| Other values (4) | 10665 |
Length
| Value | Count | Frequency (%) |
| kings | 40338 | 9.0% |
| mumbai | 22517 | 5.1% |
| indians | 22517 | 5.1% |
| royal | 21236 | 4.8% |
| challengers | 21236 | 4.8% |
| bangalore | 21236 | 4.8% |
| kolkata | 20940 | 4.7% |
| knight | 20940 | 4.7% |
| riders | 20940 | 4.7% |
| xi | 20782 | 4.7% |
| Other values (21) | 213145 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
over
Real number (ℝ≥0)
| Distinct | 20 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.16248785 |
| Minimum | 1 |
|---|---|
| Maximum | 20 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 5 |
| median | 10 |
| Q3 | 15 |
| 95-th percentile | 19 |
| Maximum | 20 |
| Range | 19 |
| Interquartile range (IQR) | 10 |
Descriptive statistics
| Standard deviation | 5.677684313 |
|---|---|
| Coefficient of variation (CV) | 0.558690391 |
| Kurtosis | -1.183356367 |
| Mean | 10.16248785 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | 0.04901758304 |
| Sum | 1819878 |
| Variance | 32.23609916 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 9603 | 5.4% |
| 2 | 9498 | 5.3% |
| 3 | 9415 | 5.3% |
| 4 | 9379 | 5.2% |
| 5 | 9345 | 5.2% |
| 6 | 9326 | 5.2% |
| 7 | 9283 | 5.2% |
| 8 | 9253 | 5.2% |
| 9 | 9231 | 5.2% |
| 10 | 9184 | 5.1% |
| Other values (10) | 85561 |
| Value | Count | Frequency (%) |
| 1 | 9603 | |
| 2 | 9498 | |
| 3 | 9415 | |
| 4 | 9379 | |
| 5 | 9345 | |
| 6 | 9326 | |
| 7 | 9283 | |
| 8 | 9253 | |
| 9 | 9231 | |
| 10 | 9184 |
| Value | Count | Frequency (%) |
| 20 | 6738 | |
| 19 | 7866 | |
| 18 | 8387 | |
| 17 | 8648 | |
| 16 | 8761 | |
| 15 | 8900 | |
| 14 | 8978 | |
| 13 | 9073 | |
| 12 | 9090 | |
| 11 | 9120 |
ball
Real number (ℝ≥0)
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.615586504 |
| Minimum | 1 |
|---|---|
| Maximum | 9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 2 |
| median | 4 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 9 |
| Range | 8 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 1.806965975 |
|---|---|
| Coefficient of variation (CV) | 0.4997711915 |
| Kurtosis | -1.083107949 |
| Mean | 3.615586504 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | 0.09612230007 |
| Sum | 647472 |
| Variance | 3.265126035 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 29047 | |
| 2 | 28963 | |
| 3 | 28878 | |
| 4 | 28812 | |
| 5 | 28720 | |
| 6 | 28628 | |
| 7 | 5113 | 2.9% |
| 8 | 795 | 0.4% |
| 9 | 122 | 0.1% |
| Value | Count | Frequency (%) |
| 1 | 29047 | |
| 2 | 28963 | |
| 3 | 28878 | |
| 4 | 28812 | |
| 5 | 28720 | |
| 6 | 28628 | |
| 7 | 5113 | 2.9% |
| 8 | 795 | 0.4% |
| 9 | 122 | 0.1% |
| Value | Count | Frequency (%) |
| 9 | 122 | 0.1% |
| 8 | 795 | 0.4% |
| 7 | 5113 | 2.9% |
| 6 | 28628 | |
| 5 | 28720 | |
| 4 | 28812 | |
| 3 | 28878 | |
| 2 | 28963 | |
| 1 | 29047 |
| Distinct | 516 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| V Kohli | 4211 |
|---|---|
| SK Raina | 4044 |
| RG Sharma | 3816 |
| S Dhawan | 3776 |
| G Gambhir | 3524 |
| Other values (511) |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 9.318967154 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 13 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | DA Warner |
|---|---|
| 2nd row | DA Warner |
| 3rd row | DA Warner |
| 4th row | DA Warner |
| 5th row | DA Warner |
Common Values
| Value | Count | Frequency (%) |
| V Kohli | 4211 | 2.4% |
| SK Raina | 4044 | 2.3% |
| RG Sharma | 3816 | 2.1% |
| S Dhawan | 3776 | 2.1% |
| G Gambhir | 3524 | 2.0% |
| RV Uthappa | 3492 | 1.9% |
| DA Warner | 3398 | 1.9% |
| MS Dhoni | 3318 | 1.9% |
| AM Rahane | 3215 | 1.8% |
| CH Gayle | 3131 | 1.7% |
| Other values (506) | 143153 |
Length
| Value | Count | Frequency (%) |
| s | 6778 | 1.8% |
| v | 6474 | 1.8% |
| singh | 4936 | 1.3% |
| da | 4774 | 1.3% |
| sr | 4683 | 1.3% |
| sharma | 4675 | 1.3% |
| m | 4395 | 1.2% |
| de | 4367 | 1.2% |
| sk | 4324 | 1.2% |
| kohli | 4231 | 1.2% |
| Other values (686) | 317432 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 511 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| SK Raina | 4173 |
|---|---|
| S Dhawan | 4090 |
| V Kohli | 4071 |
| RG Sharma | 3858 |
| G Gambhir | 3740 |
| Other values (506) |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 9.320647986 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 4 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | S Dhawan |
|---|---|
| 2nd row | S Dhawan |
| 3rd row | S Dhawan |
| 4th row | S Dhawan |
| 5th row | S Dhawan |
Common Values
| Value | Count | Frequency (%) |
| SK Raina | 4173 | 2.3% |
| S Dhawan | 4090 | 2.3% |
| V Kohli | 4071 | 2.3% |
| RG Sharma | 3858 | 2.2% |
| G Gambhir | 3740 | 2.1% |
| AM Rahane | 3467 | 1.9% |
| RV Uthappa | 3381 | 1.9% |
| DA Warner | 3127 | 1.7% |
| CH Gayle | 3023 | 1.7% |
| AB de Villiers | 2996 | 1.7% |
| Other values (501) | 143152 |
Length
| Value | Count | Frequency (%) |
| s | 7043 | 1.9% |
| v | 6487 | 1.8% |
| sr | 4897 | 1.3% |
| sharma | 4806 | 1.3% |
| singh | 4695 | 1.3% |
| m | 4518 | 1.2% |
| da | 4491 | 1.2% |
| sk | 4423 | 1.2% |
| de | 4315 | 1.2% |
| dhawan | 4209 | 1.1% |
| Other values (684) | 317210 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 405 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| Harbhajan Singh | 3451 |
|---|---|
| A Mishra | 3172 |
| PP Chawla | 3157 |
| R Ashwin | 3016 |
| SL Malinga | 2974 |
| Other values (400) |
Length
| Max length | 17 |
|---|---|
| Median length | 9 |
| Mean length | 9.464836552 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | TS Mills |
|---|---|
| 2nd row | TS Mills |
| 3rd row | TS Mills |
| 4th row | TS Mills |
| 5th row | TS Mills |
Common Values
| Value | Count | Frequency (%) |
| Harbhajan Singh | 3451 | 1.9% |
| A Mishra | 3172 | 1.8% |
| PP Chawla | 3157 | 1.8% |
| R Ashwin | 3016 | 1.7% |
| SL Malinga | 2974 | 1.7% |
| DJ Bravo | 2711 | 1.5% |
| B Kumar | 2707 | 1.5% |
| P Kumar | 2637 | 1.5% |
| UT Yadav | 2605 | 1.5% |
| SP Narine | 2600 | 1.5% |
| Other values (395) | 150048 |
Length
| Value | Count | Frequency (%) |
| r | 9707 | 2.7% |
| singh | 9243 | 2.5% |
| sharma | 9188 | 2.5% |
| a | 8586 | 2.4% |
| kumar | 7561 | 2.1% |
| s | 6896 | 1.9% |
| m | 6348 | 1.7% |
| p | 5150 | 1.4% |
| pp | 5102 | 1.4% |
| b | 4200 | 1.2% |
| Other values (559) | 292640 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 1 | 81 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 178997 | |
| 1 | 81 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 178997 | |
| 1 | 81 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
wide_runs
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.03672142865 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 173673 |
| Zeros (%) | 97.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.2511611312 |
|---|---|
| Coefficient of variation (CV) | 6.839633981 |
| Kurtosis | 191.6858792 |
| Mean | 0.03672142865 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 11.66307776 |
| Sum | 6576 |
| Variance | 0.06308191384 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 173673 | |
| 1 | 4915 | 2.7% |
| 2 | 230 | 0.1% |
| 5 | 208 | 0.1% |
| 3 | 47 | < 0.1% |
| 4 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 173673 | |
| 1 | 4915 | 2.7% |
| 2 | 230 | 0.1% |
| 3 | 47 | < 0.1% |
| 4 | 5 | < 0.1% |
| 5 | 208 | 0.1% |
| Value | Count | Frequency (%) |
| 5 | 208 | 0.1% |
| 4 | 5 | < 0.1% |
| 3 | 47 | < 0.1% |
| 2 | 230 | 0.1% |
| 1 | 4915 | 2.7% |
| 0 | 173673 |
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 1 | 324 |
| 4 | 123 |
| 2 | 31 |
| 3 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 178598 | |
| 1 | 324 | 0.2% |
| 4 | 123 | 0.1% |
| 2 | 31 | < 0.1% |
| 3 | 2 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 178598 | |
| 1 | 324 | 0.2% |
| 4 | 123 | 0.1% |
| 2 | 31 | < 0.1% |
| 3 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
legbye_runs
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 6 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.02113604128 |
| Minimum | 0 |
|---|---|
| Maximum | 5 |
| Zeros | 176141 |
| Zeros (%) | 98.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0 |
| Maximum | 5 |
| Range | 5 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.1949082998 |
|---|---|
| Coefficient of variation (CV) | 9.221608588 |
| Kurtosis | 242.3265243 |
| Mean | 0.02113604128 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 13.77728696 |
| Sum | 3785 |
| Variance | 0.03798924532 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 176141 | |
| 1 | 2558 | 1.4% |
| 4 | 220 | 0.1% |
| 2 | 138 | 0.1% |
| 3 | 17 | < 0.1% |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 176141 | |
| 1 | 2558 | 1.4% |
| 2 | 138 | 0.1% |
| 3 | 17 | < 0.1% |
| 4 | 220 | 0.1% |
| 5 | 4 | < 0.1% |
| Value | Count | Frequency (%) |
| 5 | 4 | < 0.1% |
| 4 | 220 | 0.1% |
| 3 | 17 | < 0.1% |
| 2 | 138 | 0.1% |
| 1 | 2558 | 1.4% |
| 0 | 176141 |
noball_runs
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 1 | 698 |
| 2 | 9 |
| 5 | 6 |
| 3 | 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 178364 | |
| 1 | 698 | 0.4% |
| 2 | 9 | < 0.1% |
| 5 | 6 | < 0.1% |
| 3 | 1 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 178364 | |
| 1 | 698 | 0.4% |
| 2 | 9 | < 0.1% |
| 5 | 6 | < 0.1% |
| 3 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 1.4 MiB |
| 0 | |
|---|---|
| 5 | 2 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 179076 | |
| 5 | 2 | < 0.1% |
Length
Pie chart
| Value | Count | Frequency (%) |
| 0 | 179076 | |
| 5 | 2 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
batsman_runs
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.246864495 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 70845 |
| Zeros (%) | 39.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.608270266 |
|---|---|
| Coefficient of variation (CV) | 1.289851682 |
| Kurtosis | 1.632692902 |
| Mean | 1.246864495 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.582522721 |
| Sum | 223286 |
| Variance | 2.586533248 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 70845 | |
| 1 | 67523 | |
| 4 | 20392 | 11.4% |
| 2 | 11471 | 6.4% |
| 6 | 8170 | 4.6% |
| 3 | 587 | 0.3% |
| 5 | 79 | < 0.1% |
| 7 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 70845 | |
| 1 | 67523 | |
| 2 | 11471 | 6.4% |
| 3 | 587 | 0.3% |
| 4 | 20392 | 11.4% |
| 5 | 79 | < 0.1% |
| 6 | 8170 | 4.6% |
| 7 | 11 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 11 | < 0.1% |
| 6 | 8170 | 4.6% |
| 5 | 79 | < 0.1% |
| 4 | 20392 | 11.4% |
| 3 | 587 | 0.3% |
| 2 | 11471 | 6.4% |
| 1 | 67523 | |
| 0 | 70845 |
extra_runs
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.06703224293 |
| Minimum | 0 |
|---|---|
| Maximum | 7 |
| Zeros | 169541 |
| Zeros (%) | 94.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 1 |
| Maximum | 7 |
| Range | 7 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.3425529326 |
|---|---|
| Coefficient of variation (CV) | 5.110271082 |
| Kurtosis | 91.227968 |
| Mean | 0.06703224293 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 8.234162663 |
| Sum | 12004 |
| Variance | 0.1173425116 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 169541 | |
| 1 | 8495 | 4.7% |
| 2 | 407 | 0.2% |
| 4 | 348 | 0.2% |
| 5 | 219 | 0.1% |
| 3 | 67 | < 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 169541 | |
| 1 | 8495 | 4.7% |
| 2 | 407 | 0.2% |
| 3 | 67 | < 0.1% |
| 4 | 348 | 0.2% |
| 5 | 219 | 0.1% |
| 7 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 7 | 1 | < 0.1% |
| 5 | 219 | 0.1% |
| 4 | 348 | 0.2% |
| 3 | 67 | < 0.1% |
| 2 | 407 | 0.2% |
| 1 | 8495 | 4.7% |
| 0 | 169541 |
total_runs
Real number (ℝ≥0)
HIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONHIGH CORRELATIONZEROS| Distinct | 10 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.313896738 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 63002 |
| Zeros (%) | 35.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 1.4 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 1 |
| Q3 | 1 |
| 95-th percentile | 4 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.605421643 |
|---|---|
| Coefficient of variation (CV) | 1.221878095 |
| Kurtosis | 1.640138506 |
| Mean | 1.313896738 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.556932828 |
| Sum | 235290 |
| Variance | 2.57737865 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 73059 | |
| 0 | 63002 | |
| 4 | 20599 | 11.5% |
| 2 | 13125 | 7.3% |
| 6 | 8148 | 4.5% |
| 3 | 688 | 0.4% |
| 5 | 339 | 0.2% |
| 8 | 64 | < 0.1% |
| 7 | 38 | < 0.1% |
| 10 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 0 | 63002 | |
| 1 | 73059 | |
| 2 | 13125 | 7.3% |
| 3 | 688 | 0.4% |
| 4 | 20599 | 11.5% |
| 5 | 339 | 0.2% |
| 6 | 8148 | 4.5% |
| 7 | 38 | < 0.1% |
| 8 | 64 | < 0.1% |
| 10 | 16 | < 0.1% |
| Value | Count | Frequency (%) |
| 10 | 16 | < 0.1% |
| 8 | 64 | < 0.1% |
| 7 | 38 | < 0.1% |
| 6 | 8148 | 4.5% |
| 5 | 339 | 0.2% |
| 4 | 20599 | 11.5% |
| 3 | 688 | 0.4% |
| 2 | 13125 | 7.3% |
| 1 | 73059 | |
| 0 | 63002 |
| Distinct | 487 |
|---|---|
| Distinct (%) | 5.5% |
| Missing | 170244 |
| Missing (%) | 95.1% |
| Memory size | 1.4 MiB |
| SK Raina | 162 |
|---|---|
| RG Sharma | 155 |
| RV Uthappa | 153 |
| V Kohli | 143 |
| S Dhawan | 137 |
| Other values (482) |
Length
| Max length | 20 |
|---|---|
| Median length | 9 |
| Mean length | 9.35340729 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 80 ? |
|---|---|
| Unique (%) | 0.9% |
Sample
| 1st row | DA Warner |
|---|---|
| 2nd row | S Dhawan |
| 3rd row | MC Henriques |
| 4th row | Yuvraj Singh |
| 5th row | Mandeep Singh |
Common Values
| Value | Count | Frequency (%) |
| SK Raina | 162 | 0.1% |
| RG Sharma | 155 | 0.1% |
| RV Uthappa | 153 | 0.1% |
| V Kohli | 143 | 0.1% |
| S Dhawan | 137 | 0.1% |
| G Gambhir | 136 | 0.1% |
| KD Karthik | 135 | 0.1% |
| PA Patel | 126 | 0.1% |
| AM Rahane | 116 | 0.1% |
| AT Rayudu | 115 | 0.1% |
| Other values (477) | 7456 | 4.2% |
| (Missing) | 170244 |
Length
| Value | Count | Frequency (%) |
| singh | 316 | 1.7% |
| s | 311 | 1.7% |
| v | 261 | 1.4% |
| r | 246 | 1.4% |
| m | 241 | 1.3% |
| sharma | 237 | 1.3% |
| sk | 189 | 1.0% |
| patel | 189 | 1.0% |
| sr | 184 | 1.0% |
| de | 175 | 1.0% |
| Other values (653) | 15744 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 9 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 170244 |
| Missing (%) | 95.1% |
| Memory size | 1.4 MiB |
| caught | |
|---|---|
| bowled | |
| run out | |
| lbw | |
| stumped | 278 |
| Other values (4) | 235 |
Length
| Max length | 21 |
|---|---|
| Median length | 6 |
| Mean length | 6.223341635 |
| Min length | 3 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | caught |
|---|---|
| 2nd row | caught |
| 3rd row | caught |
| 4th row | bowled |
| 5th row | bowled |
Common Values
| Value | Count | Frequency (%) |
| caught | 5348 | 3.0% |
| bowled | 1581 | 0.9% |
| run out | 852 | 0.5% |
| lbw | 540 | 0.3% |
| stumped | 278 | 0.2% |
| caught and bowled | 211 | 0.1% |
| retired hurt | 12 | < 0.1% |
| hit wicket | 10 | < 0.1% |
| obstructing the field | 2 | < 0.1% |
| (Missing) | 170244 |
Length
Pie chart
| Value | Count | Frequency (%) |
| caught | 5559 | |
| bowled | 1792 | 17.7% |
| run | 852 | 8.4% |
| out | 852 | 8.4% |
| lbw | 540 | 5.3% |
| stumped | 278 | 2.7% |
| and | 211 | 2.1% |
| retired | 12 | 0.1% |
| hurt | 12 | 0.1% |
| hit | 10 | 0.1% |
| Other values (4) | 16 | 0.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
| Distinct | 499 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 172630 |
| Missing (%) | 96.4% |
| Memory size | 1.4 MiB |
| MS Dhoni | 159 |
|---|---|
| KD Karthik | 152 |
| RV Uthappa | 125 |
| SK Raina | 115 |
| AB de Villiers | 114 |
| Other values (494) |
Length
| Max length | 21 |
|---|---|
| Median length | 9 |
| Mean length | 9.462779156 |
| Min length | 5 |
Characters and Unicode
| Total characters | 0 |
|---|---|
| Distinct characters | 0 |
| Distinct categories | 0 ? |
| Distinct scripts | 0 ? |
| Distinct blocks | 0 ? |
Unique
| Unique | 93 ? |
|---|---|
| Unique (%) | 1.4% |
Sample
| 1st row | Mandeep Singh |
|---|---|
| 2nd row | Sachin Baby |
| 3rd row | Sachin Baby |
| 4th row | DA Warner |
| 5th row | BCJ Cutting |
Common Values
| Value | Count | Frequency (%) |
| MS Dhoni | 159 | 0.1% |
| KD Karthik | 152 | 0.1% |
| RV Uthappa | 125 | 0.1% |
| SK Raina | 115 | 0.1% |
| AB de Villiers | 114 | 0.1% |
| PA Patel | 97 | 0.1% |
| RG Sharma | 92 | 0.1% |
| V Kohli | 90 | 0.1% |
| KA Pollard | 85 | < 0.1% |
| WP Saha | 82 | < 0.1% |
| Other values (489) | 5337 | 3.0% |
| (Missing) | 172630 |
Length
| Value | Count | Frequency (%) |
| singh | 204 | 1.5% |
| r | 202 | 1.5% |
| s | 198 | 1.5% |
| ms | 194 | 1.5% |
| m | 192 | 1.4% |
| sharma | 188 | 1.4% |
| de | 169 | 1.3% |
| karthik | 166 | 1.2% |
| patel | 164 | 1.2% |
| dhoni | 159 | 1.2% |
| Other values (618) | 11488 |
Most occurring characters
| Value | Count | Frequency (%) |
| No values found. | ||
Most occurring categories
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per category
Most occurring scripts
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per script
Most occurring blocks
| Value | Count | Frequency (%) |
| No values found. | ||
Most frequent character per block
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| match_id | inning | batting_team | bowling_team | over | ball | batsman | non_striker | bowler | is_super_over | wide_runs | bye_runs | legbye_runs | noball_runs | penalty_runs | batsman_runs | extra_runs | total_runs | player_dismissed | dismissal_kind | fielder | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 1 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 1 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 2 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 2 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 3 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 4 | NaN | NaN | NaN |
| 3 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 4 | DA Warner | S Dhawan | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 4 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 5 | DA Warner | S Dhawan | TS Mills | 0 | 2 | 0 | 0 | 0 | 0 | 0 | 2 | 2 | NaN | NaN | NaN |
| 5 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 6 | S Dhawan | DA Warner | TS Mills | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 6 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 1 | 7 | S Dhawan | DA Warner | TS Mills | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 1 | 1 | NaN | NaN | NaN |
| 7 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 2 | 1 | S Dhawan | DA Warner | A Choudhary | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 8 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 2 | 2 | DA Warner | S Dhawan | A Choudhary | 0 | 0 | 0 | 0 | 0 | 0 | 4 | 0 | 4 | NaN | NaN | NaN |
| 9 | 1 | 1 | Sunrisers Hyderabad | Royal Challengers Bangalore | 2 | 3 | DA Warner | S Dhawan | A Choudhary | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 1 | 1 | NaN | NaN | NaN |
Last rows
| match_id | inning | batting_team | bowling_team | over | ball | batsman | non_striker | bowler | is_super_over | wide_runs | bye_runs | legbye_runs | noball_runs | penalty_runs | batsman_runs | extra_runs | total_runs | player_dismissed | dismissal_kind | fielder | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 179068 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 19 | 3 | RA Jadeja | SR Watson | JJ Bumrah | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 2 | NaN | NaN | NaN |
| 179069 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 19 | 4 | RA Jadeja | SR Watson | JJ Bumrah | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | NaN | NaN | NaN |
| 179070 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 19 | 5 | RA Jadeja | SR Watson | JJ Bumrah | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 2 | NaN | NaN | NaN |
| 179071 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 19 | 6 | RA Jadeja | SR Watson | JJ Bumrah | 0 | 0 | 4 | 0 | 0 | 0 | 4 | 4 | 8 | NaN | NaN | NaN |
| 179072 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 20 | 1 | SR Watson | RA Jadeja | SL Malinga | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 179073 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 20 | 2 | RA Jadeja | SR Watson | SL Malinga | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | NaN | NaN | NaN |
| 179074 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 20 | 3 | SR Watson | RA Jadeja | SL Malinga | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 2 | NaN | NaN | NaN |
| 179075 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 20 | 4 | SR Watson | RA Jadeja | SL Malinga | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 1 | SR Watson | run out | KH Pandya |
| 179076 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 20 | 5 | SN Thakur | RA Jadeja | SL Malinga | 0 | 0 | 0 | 0 | 0 | 0 | 2 | 0 | 2 | NaN | NaN | NaN |
| 179077 | 11415 | 2 | Chennai Super Kings | Mumbai Indians | 20 | 6 | SN Thakur | RA Jadeja | SL Malinga | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | SN Thakur | lbw | NaN |